Model Selection

Image Classification Backbone

# Image Classification Backbone

Focalnet Huge Fl4.ms In22k

FocalNet is an image classification model based on the focal modulation network, pretrained by the Microsoft team on the ImageNet-22k dataset.

Image Classification

Swinv2 Base Patch4 Window12to24 192to384 22kto1k Ft

Swin Transformer v2 is a vision transformer model that achieves efficient image classification and dense recognition tasks through hierarchical feature maps and local window self-attention mechanisms.

Image Classification

Swinv2 Base Patch4 Window12 192 22k

Swin Transformer v2 is a vision Transformer model that achieves efficient image processing through hierarchical feature maps and local window self-attention mechanisms.

Image Classification

Swinv2 Small Patch4 Window16 256

Swin Transformer v2 is a vision Transformer model that achieves efficient image processing through hierarchical feature maps and local window self-attention mechanisms.

Image Classification

Swinv2 Tiny Patch4 Window16 256

Swin Transformer v2 is a vision Transformer model that achieves efficient image classification through hierarchical feature maps and local window self-attention mechanisms.

Image Classification

Swin Small Patch4 Window7 224

Swin Transformer is a hierarchical window-based vision Transformer model designed for image classification tasks, with computational complexity linearly related to input image size.

Image Classification

Swin Tiny Patch4 Window7 224

Swin Transformer is a hierarchical vision Transformer that achieves linear computational complexity by computing self-attention within local windows, making it suitable for image classification tasks.

Image Classification

Swin Base Patch4 Window7 224

Swin Transformer is a hierarchical vision transformer based on shifted windows, suitable for image classification tasks.

Image Classification

Swin Large Patch4 Window7 224

Swin Transformer is a hierarchical vision Transformer that achieves linear computational complexity by computing self-attention within local windows, making it suitable for image classification and dense recognition tasks.

Image Classification

Swin Large Patch4 Window7 224 In22k

Swin Transformer is a hierarchical vision transformer based on shifted windows, pretrained on the ImageNet-21k dataset, suitable for image classification tasks.

Image Classification

Swin Base Patch4 Window12 384

Swin Transformer is a hierarchical vision transformer based on shifted windows, specifically designed for image classification tasks, with computational complexity linear to input image size.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase